|
Sliced inverse regression (SIR) is a tool for dimension reduction in the field of multivariate statistics. In statistics, regression analysis is a popular way of studying the relationship between a response variable ''y'' and its explanatory variable , which is a ''p''-dimensional vector. There are several approaches which come under the term of regression. For example parametric methods include multiple linear regression; non-parametric techniques include local smoothing. With high-dimensional data (as ''p'' grows), the number of observations needed to use local smoothing methods escalates exponentially. Reducing the number of dimensions makes the operation computable. Dimension reduction aims to show only the most important directions of the data. SIR uses the inverse regression curve, to perform a weighted principal component analysis, with which one identifies the effective dimension reducing directions. This article first introduces the reader to the subject of dimension reduction and how it is performed using the model here. There is then a short review on inverse regression, which later brings these pieces together. ==Model== Given a response variable and a (random) vector of explanatory variables, SIR is based on the model where are unknown projection vectors. is an unknown number (the dimensionality of the space we try to reduce our data to) and, of course, as we want to reduce dimension, smaller than . is an unknown function on , as it only depends on arguments, and is the error with and finite variance . The model describes an ideal solution, where depends on only through a dimensional subspace. I.e. one can reduce to dimension of the explanatory variable from to a smaller number without losing any information. An equivalent version of is: the conditional distribution of given depends on only through the dimensional random vector . This perfectly reduced vector can be seen as informative as the original in explaining . The unknown are called the ''effective dimension reducing directions'' (EDR-directions). The space that is spanned by these vectors is denoted the ''effective dimension reducing space'' (EDR-space). 抄文引用元・出典: フリー百科事典『 ウィキペディア(Wikipedia)』 ■ウィキペディアで「Sliced inverse regression」の詳細全文を読む スポンサード リンク
|